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We address the problem of decoding sparse quantum error correction codes. For Pauli channels, 
this task can be accomplished by a version of the belief propagation algorithm used for decoding 
sparse classical codes. Quantum codes pose two new challenges however. Firstly, their Tanner graph 
unavoidably contain small loops which typically undermines the performance of belief propagation. 
Secondly, sparse quantum codes are by definition highly degenerate. The standard belief propagation 
algorithm does not exploit this feature, but rather it is impaired by it. We propose heuristic methods 
to improve belief propagation decoding, specifically targeted at these two problems. While our 
results exhibit a clear improvement due to the proposed heuristic methods, they also indicate that 
the main source of errors in the quantum coding scheme remains in the decoding. 

PACS numbers: 



I. INTRODUCTION 

Quantum information science studies the information 
transmission and processing capabilities of quantum me- 
chanical systems. It is now well established that quan- 
tum mechanical devices can in principle outperform clas- 
sical ones in these two settings. The physical realization 
of such schemes, however, relies on our ability to cope 
with imperfect noisy systems. The theory of quantum 
error correction (QEC) and its fault-tolerant implemen- 
tation 0, [U, H2, M IM [U, El shows that this chal- 
lenge is surmountable. The vast majority of studies of 
QEC and fault-tolerance has focused on small size block 
codes encoding one or very few qubits, and concatena- 
tion thereof. Noticeable exceptions are topological codes 
(that have a vanishing rate), convolutional codes 
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3ll. |32H , low density parity check (LDPC) codes 



I lSl, l22|, |42j , and turbo-codes [35| . 

The main difficulty with small block QEC codes is that 
in order to achieve high error suppression, each logical 
qubit needs to be encoded in a very large number of 
physical qubits (the encodings are of low rates). This 
overhead poses an extra technical challenge for the design 
of practical quantum information processors. In classical 
coding theory, it is well known that smaller overheads 
can be achieved by encoding multiple bits in a single 
large block. In fact, random codes on n bits typically 
achieve the Shannon capacity in terms of maximum rate 
at vanishing error probability as n tends to infinity. The 
drawback, however, is that in general these codes cannot 
be decoded efficiently, i.e., computing the optimal recov- 
ery given an error syndrome is an NP-complete problem. 

Two approaches can be pursued to overcome the de- 



* Present address: Departement de Physique, Universite de Sher- 
brooke, QC, Canada. 

^Electronic address: David. Poulin@UShcrbrooke.ca 
t Electronic address: ychung@mail.smu.edu 



coding problem. Algebraic codes (e.g. Reed-Solomon 
codes [33, Bose-Chaudhuri-Hocquenghem codes frj Il8l]) 
are designed to have large minimal distances and efficient 
minimal distance decoders. Their construction relies on 
the structure of finite-field algebra, and they come an in- 
finite variety of rates and minimal distances. Algebraic 
codes have been investigated in the quantum setting, e.g. 
@, i, M SI- As argued e.g. in [ll| however, due to 
their strict focus on the minimal distance, these codes are 
bound to fall short of the Shannon limit. They are on the 
other hand uncqualed for channels with burst errors, and 
have therefore found numerous commercial applications. 

The other approach is probabilistic coding (e.g., low 
density parity check [LDPC] codes, turbo-codes) which 
differs from algebraic coding in three important aspects. 
Firstly, the codes are chosen at random from a speci- 
fied ensemble, and one is interested in the typical be- 
havior over the ensemble. Secondly, minimum distance 
decoding is typically a hard problem on these code en- 
sembles. This leads to the third important distinc- 
tion which is a shift of focus from minimal distance 
to the average performance of the code ensemble un- 
der (sub-optimal) polynomial-time decoding. For some 
common classical channels, state-of-the-art probabilistic 
codes are near-capacity achieving. The randomness in- 
volved in the code design yields good codes while the 
structure imposed on the code ensemble is sufficient 
to ensure good decoding performances. Probabilistic 
codes have also been investi gate d in the quantum con- 
text @, M M M M Hi Mm ■ 

Probabilistic codes are decoded by means of a belief 
propagation (BP) algorithm (also known as sum-product, 
message passing, Vitcrbi algorithm, Bahl-Cockc-Jenilek- 
Raviv algorithm depending on context). BP is a highly 
parallel general-purpose algorithm to solve (or approxi- 
mate the solution to) inference problems involving large 
number of random variables located at the vertices of 
a graph, where the edges of the graph encode depen- 
dency between the random variables |l|, [H, |4(| . When 
the graph is a tree, BP produces the exact solution to the 
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inference problem in a time bounded by the tree's depth. 
More importantly, BP has proven to be a good heuris- 
tic method in cases where it is not known to converge, 
and particularly when the typical size of the loops in the 
graph is large. Recently, a generalization of BP was pro- 
posed to solve inference problems involving a large num- 
ber of quantum systems fl7l l25j . It was also shown that 
decoding a stabilizer QEC code on any memorylcss quan- 
tum channel reduces to a quantum inference problem on 
a (generally loopy) graph. 

Sparse codes (a.k.a. LDPC) are designed to accom- 
plish a good performance under BP decoding. They are 
obtained by imposing random linear constraints on code- 
words, with each constraint involving a small number of 
bits and similarly each bit being involved in a small num- 
ber of constraints. The constraints can be represented 
by a Tanner graph (c.f. Fig. QJ. As a consequence of the 
sparseness of the constraints, the graph typically contains 
no or very few small loops. Decoded with a BP algo- 
rithm, sparse codes are amongst the best known classical 
error correction codes. For this reason, it is desirable to 
generalize them to the quantum setting. 

A method due to Calderbank, Shor, and Steane 0, |43| 
(CSS) enables the construction of QEC codes from any 
pair of dual classical codes. This technique has been 
used extensively to design small block codes, and recently 
to design "hybrid" (small block flip, large block phase) 
QEC codes tailored for particular channels |T9|. One 
might expect the CSS construction to leverage the power 
of sparse codes to QEC, but attempts in that direction 
have proven quite difficult 0, [2^]. The main obstacle 
in the design of sparse CSS codes is that sparse classi- 
cal codes — being good error-correcting codes — do not 
typically have sparse duals. Thus, randomly generated 
sparse codes are inappropriate and more sophisticated 
generating techniques are required for their use in CSS 
constructions. 

The duality condition — and, more generally, the com- 
mutation conditions imposed on any stabilizer codes, not 
just those obtained from the CSS construction — also 
poses two immediate complications for the decoding of 
sparse QEC codes. On one hand, it implies that the 
graph representing a QEC code must have weight-4 loops. 
This places BP decoding on a slippery slope. On the 
other hand, when a classical code has a sparse dual code, 
it has by definition low weight errors. Fortunately, er- 
rors that lie in the dual code act trivially on the encoded 
quantum information. Any pair of errors that are re- 
lated by an element of the dual code cannot and need 
not be distinguished: they are both corrected by the 
same operation. These are called "degenerate errors" 
and by definition the sparse quantum codes have many 
degenerate low-weight errors. The degeneracy is a fea- 
ture that generally improves the performance of QEC 
codes [HI, 0, El|. Unfortunately, BP does not exploit 
the degeneracy of the code and even worse, it is typically 
impaired by degeneracy. 

The primary purpose of this paper is to describe why 



degeneracy compromises BP decoding. Then, in an at- 
tempt to alleviate this problem, we propose heuristic 
techniques to partially overcome the aforementioned ob- 
tacles. Our proposed techniques yields significant im- 
provements over standard BP decoding for the cases con- 
sidered here. We note, however, that despite these im- 
provements, our methods are in need of further develop- 
ment for achieving optimal performance in broad range 
of applications. 

The rest of the paper is organized as follows. In the 
next section, we introduce the necessary background on 
QEC and set up some notation. Sec. IIIII provides a de- 
tailed description of the BP algorithm as used for the 
decoding of sparse quantum codes. The following sec- 
tion describes the concept of degeneracy in QEC, and 
presents a simple model to explain why it should affect 
the performances of BP decoding. In Sec. [V] we present 
some heuristic methods to improve BP decoding. Then, 
we present our numerical results in Sec. I VII 



II. NOTATION AND BACKGROUND 

A. Stabilizer codes 

A QEC code on n qubits is a subspacc C of the 
Hilbcrt space (C 2 )®". It can be specified as the +1 
common cigenspacc of some set of commuting operators 
Si, S2, ■ ■ ■ S m that generate under multiplication the so- 
called stabilizer group S = (S c ). Of particular interest 
are codes whose stabilizer generators are (up to a phase) 
?i-fold tensor products of 2 x 2 Pauli matrices 
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and we will refer to such operators simply as Pauli oper- 
ators. They form a group V n called the Pauli group. The 
elements of V n either commute or anti-commute, and for 
all E,F G V n , we define 



E-F = 



1 if EF = FE 
-1 if EF = -FE. 



(1) 



For E, F G V\ , this operation is summarized in the fol- 
lowing table. 
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I X Y Z 
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1-1-1 1 



For n-qubit Pauli operators E = E\ (g) E<z ® . . . g> E n and 



F 



) F„ , we have 



E ■ F 



fc=i 



Fi. 
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When S has m = n — k independent generators. 
C ~ {C 2 )® k will encode k logical qubits. Operators that 
commute with the stabilizers elements have the property 
of mapping the code space to itself, and are therefore 
called logical operators. The logical operators also form 
a group N(S), the normalizer of S in V n , which contains 
S as a normal subgroup. The logical group is defined as 
C = N(S)/S and coincides with Vk- 

For notational convenience, we will henceforth omit 
the tensor product symbol when describing an element 
of V n . For instance, the element X ® I ® I ® Y G V4 
will simply be denoted by XIIY . Alternatively, we use 
subscripts to identify the qubits that are acted upon by 
non-trivial Pauli matrices. For the above example, we 
would write X{Y^. 

The Clifford group on n qubits N(V n ) is composed of 
unitary matrices that map Pauli operators to Pauli oper- 
ators — it is the normalizer of V n in U(2 n ). One may suc- 
cinctly describe a QEC code by a Clifford matrix U act- 
ing on n qubits as follows. The stabilizer generators are 
S c = U ZJJ^ for c = 1, . . . , m. These operators clearly 
commute and are multiplicativcly independent. From [/, 
one can define the logical operators Xj = UX n -k+jW 
and Zj = UZn-k+jU^ for j = l,...k. These operators 
commute with elements of S and obey XyZy = (— 
and C ~ (Xj,Zj). With a slight abuse of notation, we 
will henceforth use C to denote N(S)/S and its repre- 
sentation given above. The code is then the set of all 
states of the form £/(|0)® m g> |0)) where \<j)) G {C 2 ) m . 
It is also convenient to define a set of "pure errors" by 
T c = UX C U' for c = l,...,m. These operators com- 
mute with each other and with the logical operators, 
and obey the commutations S c ■ T c > = (— l) Sc - c ' . The 
set {S c ,T c ,Xj,Zj} is a canonical generating set for V n - 



B. Tanner graph 

A QEC code can be represented by a decorated Tanner 
graph. This is a bipartite graph G = (V, E) with vertices 
V = QUC where the subset of vertices Q represent the n 
qubits, and the other subset C represent the m stabilizer 
generators, or checks. The graph has an edge (q, c) G E 
if and only if check c acts non-trivially on qubit q, i.e. iff 
the q'th component of S c differs from the identity. Edge 
(q, c) G E is decorated by the gth Pauli matrix of check c, 
see Figured] When checks c and c' both act non-trivially 
on at least two qubits in common, say q and q' , G will 
contain a 4-loop (c, q, d ', q'). 

To avoid the presence of 4-loops, one could make sure 
that no pair of checks c and c' act on more than one 
common qubits. The commutation condition between 
checks then implies that on that common qubit, c and c' 
must have the same Pauli matrix. This argument leads to 
the conclusion that every edge connected to qubit q must 
carry the same decoration, say Z for concreteness. Such 
a code, however, fails to detect the weight-1 error Z q . 
This would not cause any problem if Z q £ 5, but in this 
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Qubits 

FIG. 1: Decorated Tanner graph for the 5-qubit code @, H3) 
with stabilizer generators {XZZXI, IXZZX, XIXZZ, 
ZXIXZ}. 



case, qubit q factors out in the code space, and the code 
states are of the form \/j,} <g> \0) q for some e (C 2 )" -1 
that satisfy additional constraints. We thus arrive at the 
conclusion that the Tanner graphs of QEC codes must 
unavoidably contain 4-loops. 



C. Sparse quantum codes 



An edge in the Tanner graph is said to have qubit- 
degrec i, if it is connected to a qubit node of degree 
(weight) i. Similarly, it has check-degree i, if it is con- 
nected to a check node of degree i. A sparse quantum 
code has a Tanner graph with qubit- and check-degree 
bounded by some constant independent of the number of 
encoded qubits. Since the checks must obey the strict 
commutation relations, it has proven difficult to gener- 
ate them pscudo-randomly as it is done for the classical 
sparse codes. 

More precisely, a degree distribution 7(2;) = ^\ 7iX l_1 
is a polynomial with nonnegative real coefficients satis- 
fying 7(1) = 1. Here, 7, denotes the fraction of edges 
in a graph for nodes of degree i. Let Xi and pi denote 
the fraction of edges of qubit-degree and check-degree i, 
respectively. Then, the degree distribution for the Tan- 
ner graph is the pair (A, p), where A = J2i>i ^i xl1 an d 
p = J2i>i Pi x% l ■ The degree distribution (A, p) defines a 

code ensemble of rate R = 1 — J Q p(x)dx/ X(x)dx. We 
note that, in the classical theory of LDPC codes, a typical 
code in that ensemble decoded with the simple BP de- 
coder on the binary erasure channel with erasure proba- 
bility S shows successful performance if 5\(1— p(l— x)) < 
x, for x G {0,6) [i|[22l,|3|. 
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III. BELIEF PROPAGATION DECODING 

In this article, the error model that we consider are 
Pauli channels that have the form 

£(p) = E p( e ) e p e ^ ( 3 ) 

EET> n 

where p(E) > and J2eP( e ) = 1- ^ * s c l car from 
Eq. ([3]) that the phase of Pauli operators are irrelevant, 
so the sum should only be carried over the effective Pauli 
group V n /{H, But to keep the notation simple, we 
will henceforth implicitly assume that V n is quotiented by 
its center. A memoryless Pauli channel is one for which 
the probability factors as p(E) — YY q =iPq( E q) f° r au 
E = Ei E2 ■ ■ ■ E n G V n ■ A particularly relevant example 
is the depolarizing channel for which p q (I) = 1 — e and 
p q {X) = p q {Y) = P q (Z) = e/3 for all q, and for some 
depolarizing strength < e < 1. 

Once the n qubits are prepared in a code state \ip) G C, 
they are sent through the channel and state p = £ (IV'XV'I) 
is obtained by the receiver. To detect the possible errors 
at the receiver, the stabilizer generators S c are measured 
and outcome s = (si, S2, . . . , s rn ) G { — 1, l} m , called the 
error syndrome, is obtained. The state p is a statistical 
mixture of states E\ip) for different E G V n - When the 
error E that corrupted the register commutes with S c , 
the syndrome bit s c takes value +1 because S c E\tp) = 
ES c \ip) — E\ip). When E anti-commutes with S c on the 
other hand, we obtain S c E\ip) = —ES c \ip) = —E\ip), and 
hence, s c = — 1. Thus, the error syndrome provides par- 
tial information about the error that has occurred during 
the transmission. 

Of all the errors E € V n that could have corrupted 
the n qubits, only those that obey a commutation rela- 
tion with the generators of S consistent with the error 
syndrome are to be considered after the measurements 
of the ScS. In other words, the probability distribution 
of the errors conditioned on error syndrome s is 

m 

p(E\s) ! xp(E)l[d SciE . Sc . (4) 

c=l 

One decoding technique consists in identifying the most 
likely error given the syndrome s: 

E*(s) = argmax{p(£;|s)}. (5) 

Eev n 

Since all elements of V n square to the identity (up to an 
irrelevant phase), the information can then be corrected 
by applying E*{s) to the register. We will see in the next 
section however that this is not the optimal decoding 
rule. 

The computationally difficult step in the above error 
correction scheme is the evaluation of E*(s). In fact, this 
problem is NP-complete [IJ. 1 Thus, we will settle for a 



1 This problem is actually equivalent to that of finding the ground 



simpler task that consists of evaluating the qubit-wise 
most-likely error. For E = E\ ■ ■ ■ E n , let us define the 
marginal probabilities 

Pq (E q \s)= J2 p(E!...E n \a). (6) 

E 1 ,...E q _i,E q+ i,...E n 

The qubit-wise most likely error is then 

E*(s) = argmax.{pq{E q \s)}. (7) 

E q eVi 

In general, the marginal optimum E*(s) . . . E*(s) needs 
not coincide with the global optimum E*(s). 

Evaluating E* (s) is also a difficult task because Eq. ^ 
involves a sum over a number of terms exponential in n. 
This is where belief propagation comes in handy. 2 This 
algorithm operates by sending messages along the edges 
of the Tanner graph (c.f. Fig.[T]). Messages from qubit q 
to check c are denoted by m 9 _» c and messages from check 
c to qubit q are denoted m c ^ q . Messages received at and 
sent by qubit q are probability distributions over E q . In 
other words, they are arrays of 4 positive numbers, one 
for each value E q = I, X, Y, and Z. We denote n(q) the 
neighbors of qubit q 7 and define n(c) similarly. 

To initialize the algorithm, each qubit sends out a mes- 
sage to its neighbors equal to its prior probability of error 
m q ^ c (E q ) = p q (E q ) 7 where p q (E q ) is the probability en- 
tering in the definition of the memoryless channel Eq. ([3]). 
Upon reception of these messages, each check sends out 
a message to its neighboring qubits given by 

m c ^ q (E q )K E(\, Sc . £c [J m q ,^ c (E q ,)), (8) 

E q , q 'En(c)\ q 
q'en(c)\q 

where n(c)\q denotes all neighbors of c except q. The 
sum is over all Pauli operators on the neighbors of c but 
with E q held fixed. The proportionality factor can be 
fixed by normalization m c ^ q = 1. Thus, m c _> g is 
a function of the syndrome bit s c associated to check c, 
and the messages m q i^ c received from all neighbors of c, 
except q. 

Upon reception of these messages, each qubit sends out 
a message to its neighboring checks given by 

m q ^ c (E q ) (x p{E q ) m c ,^ q (E q ), (9) 

c'£n( q )\c 

where n(q)\c denotes all neighbors of q except c. Again, 
the proportionality factor can be fixed by normalization. 



state of a spin-glass, which gives rise to a host of connections 
between sparse code theory and statistical physics |30( . 
2 As noted above, Pauli channels, being defined in terms of a prob- 
ability over Vn, make use of a classical BP algorithm. For arbi- 
trary quantum channels, a quantum BP algorithm was described 
in details in l2- r l . 
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Thus, m g _, c is a function of the qubit prior probability 
p q (E q ), and the messages m c i^ q received from all neigh- 
bors of q, except c. 

Equations ([8][9]) define an iterative procedure that is at 
the core of BP. The beliefs b q (E q ) — that are meant to 
represent an approximation to the marginal conditional 
probability p q {E q \s) Eq. © — are computed from the 
m c ^ q messages as follows 

b q (E q ) = Pq (E q ) J] m c _ q (E q ). (10) 

cGri(q) 

The recovery can be chosen as the product of qubit- 
wise maximum-belief Pauli matrices E BP = ®" =1 E q p 
where 

E? p = &rgmax{b q (E q )}. (11) 

When G is a tree, the beliefs converge to the correct 
conditional marginals b q (E q ) = p q (E q \s) in a time equal 
to the tree's depth. Thus, the qubit-wise most likely error 
can be computed efficiently. This leads, for instance, to 
an efficient maximum-likelihood decoding algorithm for 
concatenated quantum block codes [34| . 

When the graph G includes loops, the beliefs will not 
converge to the correct conditional marginals in general. 
Nonetheless BP can be used as a heuristic method to 
approximate the solution to such intrinsically hard prob- 
lems, and in many situations it is empirically observed to 
provide reliable solutions. Apropos examples of success- 
ful loopy BP are provided by the near-capacity achieving 
decoding of classical LDPC and turbo-codes. 

Since loopy BP is not guaranteed to converge within 
any fixed number of iterations, one has to impose a halt- 
ing criterion on the iterative procedure. Here, we choose 
to halt when the correction given by the highest be- 
liefs E BP lead to a trivial error syndrome, i.e., when 
E BP ■ Sj = Sj for all j = 1, ... , m. Since this is not guar- 
anteed to occur after any finite number of iterations, it 
is also necessary to impose a maximum number of itera- 
tions. 



IV. DEGENERACY 

As explained above, the output of the channel p is a 
statistical mixture of states E\ip) for different E E V n - 
Note however that when E' = ES for some operator 
S E S, then E\ip) = E'\tp) by definition of code states. 
Moreover, E and E' have the same error syndrome since 
they differ by an element of S, which by definition com- 
mute with all the generators S c . Hence, errors E and E' 
cannot be distinguished. This is not a concern however 
because any recovery R that reverses the effect of E on 
the code space will also reverse the effect of E' 



In brief, the errors that are related by an element of S 
cannot and need not be distinguished by the error syn- 
drome. This is the main feature of degeneracy in QEC. 

Since {S C ,T C , Xj, Zj} is a canonical generating set 
for Vni we can uniquely expand each error as E = 
s(E)t(E)l(E) where s{E) E S, t(E) E (T c ), and 1(E) E 
(Xj, Zj). Note that 1(E) is the only element of that de- 
composition that can anti-commute with the stabilizer 
generators. Moreover every elements of (T c ) has a dis- 
tinct commutation relation with the S c , and hence a dis- 
tinct syndrome. It follows that t(E) is entirely known 
given the error syndrome s, and it is given by 

t(s) = ]jT c -^. (12) 

c=l 

On the other hand, the value of s(E) is of no interest 
since that element of the decomposition acts trivially on 
C. The only element left is 1(E), and it has conditional 
probability 

p(L\s) cx J2 P ^ ( 13 ) 

E:t(E)=L 

= J2p( E = St(s)L). (14) 

Optimal decoding consists in identifying the most likely 
value for L given s 

L*(s) = argmax{p(L|s)}. (15) 

This differs from the decoder defined at Eq. © because 
p(L\s) is obtained from p(E\s) by quoticnting V n by S. 
In this sense, QEC codes are intrinsically coset codes. 3 

The degeneracy typically improves the performance of 
a code because many errors can be corrected in the same 
way, and need not be distinguished by the error syn- 
drome. This becomes particularly relevant when many 
elements of S have low weight since the degeneracy will 
then relate typical errors. To take advantage of this fea- 
ture, it is important to properly sum the probabilities 
over the cosets: the coset with the most likely error may 
differ from the most likely coset. However, the BP al- 
gorithm presented in the previous section uses Eq. ([5]) 
as a starting point, and so completely ignores the coset 
structure of the code. This will lead to a decline of the 
scheme's performances, and should be particularly pro- 
nounced for sparse codes. 

Beyond these complications, the degeneracy is ex- 
pected to cause extra complications when used with any 
qubit-wise maximum-likelihood decoder such as BP. This 
is due to the presence of symmetries in the code that are 
reflected in the decoder, but broken by the channel. The 
next section aims at illustrating this issue. 



RE\ip) = \ip) =S> RE'\il>) = RES\ip) = RE\ip) = \ip). 



3 In statistical physics terms, decoder Eq. l(5j minimizes the energy 
while Eq. II15I I minimizes the free energy. 
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A. A case study 

We consider a particularly simple QEC code that en- 
codes zero qubits (a single state) into two qubits. It 
has stabilizer generators XX and ZZ, and code space 
C = {(|00) + \11))/V2}. Suppose these qubits are sent 
through a depolarization channel, and that error IX is 
applied. Measurement of the stabilizer generators reveal 
the error syndrome s = (+1,-1). This error can be 
corrected by cither of the operations XI, IX, YZ, or 
ZY. Because of the symmetry of the code however, the 
marginal conditional probabilities Eq. © are identical on 
both qubits, Pi(E\s) = P-2(E\s). Since none of the cor- 
rect recoveries have this symmetry, any decoding scheme 
based on qubit-wise probabilities will fail with this very 
simple code. 

Figure [2] a) shows the beliefs obtained from the BP al- 
gorithm as a function of the number of iterations. Due 
to the symmetry of the code, the beliefs on qubit 1 and 
2 are identical. As we see, the identity is always assigned 
the largest belief, and hence, the BP decoder will rec- 
ommend correcting the error by doing nothing. This is, 
of course, incorrect. In fact, this correction will result 
in a "detected error" because there will remain a non- 
trivial syndrome after applying the recommended recov- 
ery In that case, our BP algorithm would iterate until 
it reaches the maximum number of iterations imposed 
by hand, and would output the correction II knowing 
that it is an incorrect inference. In fact, the decoder can- 
not possibly succeed because it always assigns identical 
beliefs to qubits 1 and 2, but none of the appropriate 
corrections listed above have that symmetry 

The optimal decoder must add the probability of XI 
and IX, and assign the result to either one of them; this 
choice is not important and can be made at random. This 
is not incorporated in the standard BP scheme because 
XI and IX are symmetric in all aspects. To achieve any 
improvement in the BP decoding, it is essential to break 
this symmetry. 



V. HEURISTIC METHODS FOR DEGENERATE 
CODES 

While the example of the previous section may ap- 
pear naive and contrived, it actually captures a univer- 
sal feature of sparse quantum codes: they have many 
degenerate typical errors, and the degenerate errors of 
equal weight arc completely symmetric under BP decod- 
ing. One needs to break this symmetry in order to over- 
come the difficulty explained in the previous section. We 
have investigated a host of different techniques to im- 
prove the performance of sparse quantum codes under 
BP decoding. Here, we present the most successful ones. 

In all cases, the general structure of the BP algorithm 
is unaltered. At each round of iteration, new beliefs are 
obtained and the maximum-belief correction E BP is eval- 
uated. If E BP restores a trivial syndrome, the algorithm 



halts with output E . If, on the other hand, the proce- 
dures reaches a pre-determincd fixed number of iterations 
Tpert without satisfying the halting conditions, a pertur- 
bation will be applied in order to break any symmetry 
that may be responsible for the impasse. The different 
techniques we have explored differ in the choice of the 
symmetry breaking. The presence of small loops in the 
Tanner graph turns out to be crucial because it rapidly 
propagates the symmetry breaking perturbation to all 
the qubits involved in the degenerate error. 

A. Freezing 

The first technique consists of "freezing" one of the 
qubits involved in an unsatisfied check. We first find 
a frustrated check c, i.e., S c ■ E BP ^ s c . We pick a 
random qubit q connected to that check, and freeze its 
prior probability to p q (E q ) = 5e ,i- We let the BP iterate 
with this modified prior for T pert steps. If it halts during 
this period we are done. If c is still frustrated, we restore 
the probability of q and freeze a different qubit q' involved 
in c. If c is not frustrated but the halting condition is not 
satisfied, we find another frustrated check d and freeze 
one of its qubits at random. 

When applied to the above example, the random freez- 
ing technique immediately solves the problem. If for in- 
stance we set P2CE2) = Se 2 .i, we obtain after a single 
iteration bi{E\) = 8e 1 ,x and 62(^2) = fei.Jj hence the 
correction E BP = XI. 



B. Random perturbation 

The second technique consists in randomly perturbing 
the prior probability associated to qubits involved in frus- 
trated checks. We simply identify the frustrated checks 
c, and for each q in n(c) we apply the perturbation (up 
to normalization) 

Pq (I q ) -> Pq (I q ) (16) 

Pq (X q ) - (l + S x ) Pq (X q ) (17) 

Pq (Y q ) - (l + 6 Y ) Pq (Y q ) (18) 

Pq {Z q ) (l + 6 z ) Pq (Z q ) (19) 

where 5x,Sy, and Sz are random variables distributed 
uniformly in the range [0, 5] for some fixed S. The goal 
of this perturbation is to create an asymmetry among 
the qubits that will put an end to the impasse. It is 
clear that the perturbation needs to be random in or- 
der to break the symmetry: if the error probability is 
equally increased on all unsatisfied checks the problem 
would persist. It also seems intuitive to move to a prior 
with strictly stronger error probability because at least 
one of the qubits connected to c definitely has an error. 
This choice is also justified by the fact that for the vast 
majority of decoding errors, the decoder had identified 
the identity operator as the most likely error. In other 
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FIG. 2: Beliefs as a function of number of iterations for a QEC code with stabilizer generators XX and ZZ with syndrome 
(+1, —1)- a) The prior Pq{E q ) is the depolarizing channel with e = 0.1 for both qubits. In that case, E BP = II which yields 
a detected error. On b) and c), a random perturbation of strength 5 = 1 was applied to the prior, breaking the symmetry 
between the two qubits. b) shows the beliefs for qubit 1 and c) the beliefs for qubit 2. In this case, E BP = XI which is an 
appropriate recovery. 



words, we have empirically observed that the decoder is 
naturally too biased towards the identity. 

When applied to the toy model of the previous section, 
a random perturbation produced exactly the effect we 
were hoping for, as long as the perturbation creates a 
sufficiently strong asymmetry between pi(X) andp2(A). 
An example illustrates this point on Fig. [2]b)-c). 



C. Collision 

The collision trick is an attempt to enforce the analogy 
between real decoding instances and the simple model 
presented in Sec. IfV Al ft operates in conjunction with 
other symmetry breaking techniques, ft proceeds by find- 
ing a colliding pair of unsatisfied checks c and c', such 
that both S c ■ E BP ^ s c and S d ■ E BP ^ s c >, and c 
and d share some qubits in common. Typically, c and d 
will have two qubits in common. Under the assumption 
that the non-trivial syndromes s c and s c < have a com- 
mon cause — i.e. that they are due to error on their 
common qubits — the situation reduces exactly to that 
of Sec. IfV Al Thus, we can apply a random perturbation 
to those qubits, or choose to freeze one of them. It is also 
possible to further exploit the labels of the checks on each 
qubits to determine the local degeneracy structure of the 
code. 



VI. RESULTS 

We illustrate the performances of our decoding algo- 
rithms with a bicycle sparse quantum code [1^ used over 
a depolarizing channel. We have applied our decoding al- 
gorithms to a number of other codes; in all cases we have 
found that our heuristic methods yield an improvement 
over the standard BP decoder but the best performances 
were obtained from the bicycle codes. 



Bicycle codes [29| arc CSS codes that make use of cyclic 
matrices. A simple method to create a d x d cyclic bi- 
nary matrix C consists of generating a binary column 
vector A of length d with random entries and defining 
{C)i_j = (A)i + j. A sparse cyclic matrix is obtained when 
the vector A is sparse. 

For a bicycle code with row- weight w, block length 
n and number of checks m, first generate a random 
n/2 x n/2 cyclic matrix C with row weight w/2. (As 
in [2^ |. this matrix can be constructed from a difference 
set, but the construction outline above seems to work 
just as well.) Then, construct a matrix Ho by merging 
the matrices C and C T , i.e., Ho = (C|C T ). This matrix 
is self-dual: 

(HoHjjij = (CCt + CtC) 4J (20) 

= A 1+fc A J+fc + A k+l A k+J = 0. (21) 

k 

Deleting n/2 — m/2 of the rows from Ho yields a self-dual 
matrix H with m/2 rows and n columns. This matrix is 
used in the CSS construction to obtain a code of rate 
"~ m with checks given by 

„ = \YlU z(li)c3 f°r0<c<m/2, , 9 „s 
ln"=i* (Hki for m/2 <c<m. 1 ' 

The commutativity of the checks follows from the self- 
duality of H. 

The construction of bicycle codes has complete free- 
dom in controlling the size and weight of the code. How- 
ever, all the deleted rows of Ho are low- weight codewords 
which arc unlikely to lie in the dual. Hence, while they 
offer good performances under BP decoding on the depo- 
larization channel, bicycle codes most likely have a small 
minimal distance (less or equal to w). 

The most successful methods for improving the itera- 
tive decoding performances are freezing (Sec. IV A[) com- 
bined with the method of colliding checks (Sec. IV C\\ . and 
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FIG. 3: Block error probability vs depolarization strength e 
for three different decoding procedure as explained in the text. 
The code encodes 400 logical qubits in 800 physical qubits, for 
a rate of 0.5. The checks all have degree 30 and the average 
degree of the qubits is 15. The maximum number of iterations 
is 90, the number of iterations between each perturbations is 
Tp er t — 6, and the strength of the random perturbation [c.f. 
Eqs. (UMIHJ] is 5 = 0.1. 

the simple random perturbation (Sec. IV Bj) . Our results 
obtained for bicycle codes are shown at Fig. [3] Both 
symmetry breaking techniques yield a 10 dB gain over 
the basic BP decoder at depolarizing strength 0.01 for 
the bicycle codes. In particular, the bicycle code has a 
decoding error below 10 -4 around e 0.01 using the ba- 
sic BP decoder and this value is increased to e ~ 0.014 
using our heuristic techniques (c.f. Fig. 11 of Ref. [29l|). 
Results obtained from larger blocks exhibit overall better 
performances, but the improvement due to the modified 
BP decoding are more pronounced on smaller blocks for 
the range of error probabilities accessible through Monte 
Carlo simulations. 

Note that there are two distinct types of failed error 
corrections, depending on whether EE BP G N(S) or not. 
In the former case, the error is undetected because it cor- 



responds to a codeword. These errors can generally be 
attributed to fundamental limitations of the code. In 
the latter case, it is known that the decoder has failed 
because the syndrome after the correction procedure re- 
mains non-trivial. Such a detected error indicates a fail- 
ure of the decoder. Despite the improvements yielded by 
our heuristic methods, in all the cases we have investi- 
gated, 100% of the errors were detected errors. This in- 
dicates that the major obstacle to good quantum coding 
schemes remains in the decoding. The only exceptions 
were the codes presented in Q , that we found to contain 
many weight-6 codewords. 

VII. CONCLUSION 

In this paper, we have explained how the high degen- 
eracy of sparse quantum codes will typically impair their 
performance under belief propagation decoding. Based 
on a simple model, we have proposed some heuristic tech- 
niques to partially overcome this problem. Our numeri- 
cal results show that these techniques provide a clear and 
substantial improvement of the coding scheme's perfor- 
mance, which indirectly corroborates our model. 

Despite these efforts, we found that the greatest chal- 
lenge still remains in the decoding. Indeed, all the errors 
found in our simulations could be attributed to the de- 
coder rather than the finite minimal distance of the code. 
Thus, it is necessary to make further progress in the pro- 
posed decoding techniques for their broad application. 
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